PRIME NUMBER RACES 



Andrew Granville and Greg Martin 



There's nothing quite hke a day at the races.... The quickening of the pulse as the 
starter's pistol sounds, the thrill when your favorite contestant speeds out into the lead 
(or the distress if another contestant dashes out ahead of yours), and the accompanying 
fear (or hope) that the leader might change. And what if the race is a marathon? Maybe 
one of the contestants will be far stronger than the others, taking the lead and running at 
the head of the pack for the whole race. Or perhaps the race will be more dramatic, with 
the lead changing again and again for as long as one cares to watch. 

Our race involves the odd prime numbers, divided into two teams depending on the 
remainder when they are divided by 4: 

Mod 4 Race, Team 3: 03, 07, 11, 19, 23, 31, 43, 47, 59, 67, 71, 79, 83, ... 

Mod 4 Race, Team 1: 05, 13, 17, 29, 37, 41, 53, 61, 73, 89, 93, 97, ... 

In this Mod 4 Race^, Team 3 contains the primes of the form 4n + 3, and Team 1 contains 
the primes of the form 4?7. + 1. The Mod 4 Race has just two contestants and is quite some 
marathon since it goes on forever! From the data above it appears that Team 3 is always 
in the lead; that is, up to any given point, there seem to be at least as many primes of the 
form 4n + 3 as there are primes of the form 4n + 1. Further data seems to confirm our 



^The integers in the arithmetic progression qn+a are often called the integers "congruent to a (mod q)" , 
and thus we have the "Mod q Race" . 
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initial observations: 
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X 


4t7 -I- 3 ITn to T 


An -\- 1 nn f,n t 


100 


13 


11 


200 


24 


21 


300 


32 


29 


400 


40 


37 


500 


50 


44 


600 


57 


51 


700 


65 


59 


800 


71 


67 


900 


79 


74 


1000 


87 


80 


2000 


155 


147 


3000 


218 


211 


4000 


280 


269 


5000 


339 


329 


ouuu 


oyy 


ooo 


7000 


457 


442 


8000 


507 


499 


9000 


562 


554 


10,000 


619 


609 


20,000 


1136 


1125 


50,000 


2583 


2549 


100,000 


4808 


4783 



Table 1. The number of primes of the form 4n + 1 and 4n + 3 up to x. 

Even in this extended data, the race remains close but Team 3 always seems to maintain 
a narrow lead. This phenomenon was first observed in a letter written by Tchebychev to 
M. Fuss on 23rd March 1853: 

" There is a notable difference in tie splitting of the prime numbers 
between the two forms An + 3, 4n + 1: 
The first form contains a lot more than the second. " 

This bias is perhaps unexpected in light of an important result in analytic number theory 
known as "the prime number theorem for arithmetic progressions" . This theorem tells us 
that for any given modulus q, the primes tend to be equally split amongst the various 
forms qn + a where^ gcd(a, g) = 1. More precisely, we know that for two such eligible 



^Note that this restriction is necessary, since every integer of the form qn + a is divisible by the greatest 
common divisor of a and q, hence cannot be prime (except possibly for a single value n) if gcd(a, q) > 1. 
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values a and b, 

,^ , T^i^lprimes on + a < 

(1) —7 — : ; \ — 1 as a; ^ 00. 

#{primes qn-\-o<x) 

This limit does not help us to predict who will win the Mod q Race^. In fact this asymptotic 
result, the prime number theorem for arithmetic progressions, does not inform us about 
any of the fine details of these prime number counts, and so neither verifies nor contradicts 
our observation that Team 3 seems to be always ahead of Team 1. 

It's time to come clean: we cheated in how we presented the data in the table above. If 
we take the trouble to look at the counts of Teams 1 and 3 for every value of x, not just 
the ones listed in the table, we find that occasionally there is some drama to this race, in 
that from time-to-time Team 1 takes the lead, but then only briefly: 

Team 1 takes the lead for the first time at prime 26,861. However, since 26,863 is also 

prime. Team 3 draws equal and then takes back the lead until 

Team 1 gets ahead again at 616,841 and at various numbers until 633,798. 

Team 3 then takes back the lead until 

Team 1 gets ahead again at 12,306,137 and at various numbers until 12,382,326. 
Team 3 then takes back the lead until 

Team 1 gets ahead again at 951,784,481 and at various numbers until 952,223,506. 
Team 3 then takes back the lead until 

Team 1 gets ahead again at 6,309,280,697 and at various numbers until 6,403,150,362. 
Team 3 then takes back the lead until 

Team 1 gets ahead again at 18,465,126,217 and at various numbers until 19,033,524,538. 
Team 3 then takes back the lead and holds on to it until at least twenty billion. 

So there are, from time to time, more primes of the form 4n-|- 1 than of the form 4n-|-3, 
but this lead is held only very briefly and then relinquished for a long stretch. Nonetheless, 
given this data, one might guess that 4n -|- 1 continues taking the lead from time to time 
as we continue to watch this marathon. Indeed this is the case: 

J.E. Littlewood. (1914) There are arbitrarily large values of x for which there are more 
primes of the form An + 1 up to x than primes of the form 4n -|- 3. In fact there are 
arbitrarily large values of x for which 

1 \/x 

(2) ^{primes 4n + 1 < x} — ^{primes 4n + 3 < x} > In In In a; 

2 In a; 

At first sight, this seems to be the end of the story. However, after seeing how infre- 
quently Team 1 manages to hold on to the lead, it is hard to put to rest the suspicion that 
Team 3 is in the lead "most of the time" , that there are more primes of the form 4?i -|- 3 
up to X than there are of the form 4n + 1, despite Littlewood's result. In 1962, Knapowski 
and Turan made a conjecture that is consistent with Littlewood's result but also bears out 
Tchebychev's observation: 



^If the ratio ^{primes 4n + 3 < a;}/#{primes 4n + 1 < x} converged to a number greater than 1 as 
a; — > 00, then we would know that #{primes 4n + 3 < x} > # {primes 4n + 1 < a;} for all sufficiently large 
X, so that in the long run Team 3 would always be ahead of Team 1. If it converged to a number less than 
1, in the long run Team 1 would always be ahead of Team 3. 
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Conjecture. As X ^ oo, the percentage of integers x < X for which there are more 
primes of the form 4n+ 3 up to x than of the form 4n + 1 goes to 100%. 



This conjecture may be paraphrased as 

"Tchebychev was correct almost all of the time. 

Looking at the above data in this hght: 



X in the Maximum percentage of x < X for which there 

Range are more primes 4n + 1 < x than 4n + 3 

- 26,860 0% 

- 500,000 ^ 0.01% 

0-10^ ^ 2.6% 

10^ - 10^ ^ .6% 

10« - 10^ ^ .1% 

109 - iQio ^ 1.6% 

IQio - iQii 2.8% 



Does this persuade you that the Knapowski-Turan conjecture is likely to be true? Is the 
percentage in the right column going to as the values of x get larger? The percentages 
are evidently very low, but it is not obvious from this limited data that they are tending 
towards 0. There was a wonderful Monthly article by Richard Guy some years ago entitled 
"Tiie Law of Small Numbers" , in which Guy pointed out several fascinating phenomena 
that are "evident" for small integers yet disappear when one examines bigger integers. 
Could this be one of those phenomena? 



Another prime race: primes of the form 3n + 2 and 3n+ 1. 

There are races between sequences of primes other than that between primes of the 
forms 4n + 3 and 4n + 1. For example, the Mod 3 Race is between primes of the form 
3n + 2 and primes of the form 3n + 1. The race begins as follows: 

Mod 3 Race, Team 2: 02, 05, 11, 17, 23, 29, 41, 47, 53, 59, 71, 83, 89, ... 

Mod 3 Race, Team 1: 07, 13, 19, 31, 37, 43, 61, 67, 73, 79, 97, ... 

In this race Team 2 dashes out into an early lead which it holds on to; that is, there seem 
to always be at least as many primes of the form 3n + 2 up to a; as there are primes of the 



PRIME NUMBER RACES 



form 3n + 1. In fact Team 2 keeps in the lead up to ten million and beyond^ 



X 


Team 2 


Team 1 


100 


13 


11 


200 


24 


21 


300 


33 


28 


400 


40 


37 


500 


49 


45 


600 


58 


50 


700 


65 


59 


800 


71 


67 


900 


79 


74 


1000 


87 


80 


2000 


154 


148 


3000 


222 


207 


4000 


278 


271 


5000 


338 


330 


6000 


398 


384 


7000 


455 


444 


8000 


511 


495 


9000 


564 


552 


10000 


617 


611 


20000 


1137 


1124 



X 


Team 2 


Team 1 


30000 


1634 


1610 


40000 


2113 


2089 


50000 


2576 


2556 


60000 


3042 


3014 


70000 


3491 


3443 


80000 


3938 


3898 


90000 


4374 


4338 


100000 


4807 


4784 


200000 


8995 


8988 


300000 


13026 


12970 


400000 


16967 


16892 


500000 


20804 


20733 


600000 


24573 


24524 


700000 


28306 


28236 


800000 


32032 


31918 


900000 


35676 


35597 


1000000 


39266 


39231 


2000000 


74520 


74412 


5000000 


174322 


174190 


10000000 


332384 


332194 



Table 2. The column labelled "Team /' contains the number of primes of the 
form 3n + j up to 

Perhaps our experience with the Mod 4 Race makes you skeptical that Team 2 always 
dominates in this Mod 3 Race. If so, you are right to be skeptical because Littlewood's 
1914 paper also applies to this race: There are arbitrarily large values of x for which there 
are more primes of the form 3n + 1 up to x than primes of the form 3n + 2. (An inequality 
analogous to (2) also holds for this race.) Therefore Team 1 takes the lead from Team 2 
infinitely often, but where is the first such value for xl We know it is beyond ten million, 
and in fact Team 1 first takes the lead at 

X = 608,981,813,029. 

This was discovered on Christmas day 1976 by Bays and Hudson. It seems that Team 2 
dominates in the Mod 3 Race even more than Team 3 dominates in the Mod 4 Race! 



^And this time we didn't cheat — Team 1 does not get the lead at some intermediate value of x that we 
have not recorded in the table. 
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Another prime race: The last digit of a prime. 

Having heard that "Team 2 retains the lead..." for over a half-a-triUion consecutive 
values of you might have grown bored with your day at the races. So let's move on to 
a race in which there are more competitors, perhaps making it less likely that one will so 
dominate. One popular four- way race is between the primes that end in 1, those that end 
in 3, those that end in 7 and those that end in 9. 



Last digit: 13 7 



1 


3 


7 


9 


Last digit: 


1 


3 


7 


9 




3 


7 






101 


103 


107 


109 


11 


13 
23 


17 


19 
29 






113 


127 




31 




37 






131 




137 


139 


41 


43 
53 


47 


59 




151 




157 


149 


61 




67 








163 


167 




71 


73 
83 




79 
89 




181 


173 




179 






97 






191 


193 


197 


199 


X 


~T 


"6" 


X 


Up to 200: 


10 


12 


12 


10 



Table 3. The columns labelled "Last digit f contain the primes of the form 
lOn + J up to 100, and between 100 and 200, respectively. 

On this limited evidence it seems that the two teams in the middle lanes are usually 
ahead. However let us examine some more data before making any bets: 



X 


Last digit: 


1 


3 


7 


9 


100 




5 


7 


6 


5 


200 




10 


12 


12 


10 


500 




22 


24 


24 


23 


1000 




40 


42 


46 


38 


2000 




73 


78 


77 


73 


5000 




163 


172 


169 


163 


10,000 




306 


310 


308 


303 


20,000 




563 


569 


569 


559 


50,000 




1274 


1290 


1288 


1279 


100,000 




2387 


2402 


2411 


2390 


200,000 




4478 


4517 


4503 


4484 


500,000 




10386 


10382 


10403 


10365 


1,000,000 




19617 


19665 


19621 


19593 



Table 4. The column labelled "Last digit j" contains the number of primes of 
the form lOn + j up to x. 
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Except for a brief surge by Team 1 ahead of Team 3 around x = 500, 000, it does appear 
that the two teams in the middle lanes continue to share the lead. 

After watching these races for a while, we begin to get the sense that each race is 
somewhat predictable — not the tiny details, perhaps, but the large-scale picture. If we 
were to watch a similar race (the Mod q Race for some number q other than 4 or 3 or 10), 
we might expect that some of the teams would be much stronger overall than others. But 
how could we predict which ones, without watching for so long that everybody around us 
would already be placing their bets on those same teams? 

Before we can hope to understand which team has the most primes most often, we need 
to appreciate how those prime counts behave in the first place. And before we can hope to 
understand the number of primes up to x of the form qn + a (for given a and q), we should 
start by knowing how many primes there are up to x. This was perhaps the single most 
important question in 19th-century number theory, a question which continued to engage 
researchers throughout the 20th century and is still jealously guarding most of its secrets 
from us today. 



So WHAT DO WE KNOW ABOUT THE COUNT OF PRIMES UP TO X ANYWAY? 

Although questions in number theory were not always mathematically en vogue, by the 
middle of the 19th century the problem of counting primes had attracted the attention of 
well-respected mathematicians such as Legendre, Tchebychev, and the prodigious Gauss. 
Although the best rigorous results of this time were due to Tchebychev, the prediction of 
Gauss eventually led to a better understanding of prime number races. 

A query about the frequency with which primes occur elicited the following response: 

"I pondered this problem as a boy, and determined that, at around x, the 
primes occur with density l/lnx." — C.F. Gauss (24th Dec 1849), letter to Encke 

This remark of Gauss can be interpreted as predicting that 



Let us compare Gauss's prediction^ with the most recent count of the number of primes 
up to X. (We shall wrote "Overcount" to denote Li(x) — n{x) rounded down to the nearest 
integer, which is the difference between Gauss's prediction Li{x) and the function 7r{x), 
the true number of primes up to x.) 



In fact, Euler made a similar though less well-known "prediction" some years before Gauss. 
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Jb 




I \\T(^'Vf*r\'\ in"!"* I.iio^l ■tt (nrx 

wVdUUU-liL. J_JilX ( /I IX 1 


108 


5761455 


753 


10^ 


50847534 


1700 


lO^o 


455052511 


3103 


10^1 


4118054813 


11587 


10^2 


37607912018 


38262 


1013 


346065536839 


108970 


1014 


3204941750802 


314889 


1015 


29844570422669 


1052618 


IQie 


279238341033925 


3214631 


1017 


2623557157654233 


7956588 


10i« 


24739954287740860 


21949554 


1019 


234057667276344607 


99877774 


1020 


2220819602560918840 


222744643 


1021 


21127269486018731928 


597394253 


1022 


201467286689315906290 


1932355207 



Table 5. Primes up to various a;, and the overcount in Gauss's prediction. 

We can make several observations from these overcounts: First notice that the width 
of the last column is always approximately half the width of the middle column. In other 
words the overcount seems to be about the square root of the main term 'k{x) itself. Also 
the error term seems to always be positive, and so we might guess that 

< Li(a;) - 7r(a;) < \fn{x) 

for all X. In fact the overcount appears to be monotonically increasing which suggests that 
we might be able to better approximate tt{x) by subtracting some smooth secondary term 
from the approximating function Li(a;), a question we shall return to later. But, for now, 
let's get back to the races .... 

The function Li(x) does not count primes, but it does seem to stay close to 7r(x), always 
remaining just in the lead. So for the race between Li(a;) and 7r(x), we can ask: will Li(a;) 
retain the lead forever? Littlewood's amazing result applies here too: 

Littlewood. (1914) There are arbitrarily large values of x for which 7r(x) > Li(x), that 
is, for which 



^{primes < x} > 




So what is the smallest xi for which 7r(xi) > Li(xi)? Skewes obtained an upper bound 
for xi from Littlewood's proof, though not a particularly accessible bound. He proved: 

Skewes (1933): xi < 10i°'° , 
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and to do even that he needed to make a significant assumption. Skewes assumed the "Rie- 
mann Hypothesis" , a conjecture that we shall discuss a little later. For a long time, this 
"Skewes number" was known as the largest number to which any "interesting" mathemat- 
ical meaning could be ascribed. Skewes later gave an upper bound that did not depend on 
any unproved assumption, though at the cost of making the numerical estimate marginally 
more monstrous, and several improvements have been made since then: 



As we shall discuss later, Bays and Hudson give compelling reasons to believe that xi 
is actually some integer close to 1.3982 x 10^^^. This is an extraordinary claim! We 
can only compute prime counts up to around x = 10^^ with our current technology and 
algorithms, and no significant improvements to this are foreseen. So how can they make 
such a prediction of the enormous value of xi, when this prediction is so far beyond the 
point where we can ever hope to compute 7r(x) directly? 

We shall explain this later. One thought though: If the first number x for which 7t{x) 
pulls ahead of Li(a;) is so large then surely the numbers x for which 7v{x) > Li (a;) are even 
scarcer than the corresponding "underdog" numbers in the races we examined earlier! 

Accurately estimating the count of primes. 

Up to the middle of the nineteenth century, every approach to estimating 7r{x) = 
#{primes < x} was relatively direct, based upon elementary number theory and combina- 
torial principles, or the theory of quadratic forms. In 1859, however, the great geometer 
Riemann took up the challenge of counting the primes in a very different way. He wrote 
only one paper that could be called number theory, but that one short memoir has had 
an impact lasting nearly a century and a half already, and its ideas helped to define the 
subject we now call analytic number theory. 

Riemann's memoir described a surprising approach to the problem, an approach using 
the theory of complex analysis, which was at that time still very much a developing sub- 
ject^. This new approach of Riemann's seemed to stray far away from the realm in which 
the original problem lived. However, it had two key features: 

• it was a potentially practical way to settle the question once and for all; 

• it made predictions that were similar, though not identical, to the Gauss prediction. 
Indeed, as we shall see, it suggested a secondary term to compensate somewhat for the 
overcount we saw in the data above. 

Riemann's method is too complicated to describe in its entirety here, but we shall take 
from it what we need to better understand the prime count 7t{x). To begin, we take the 

^Indeed, Riemann's memoir on this number-theoretic problem was a significant factor in the develop- 
ment of the theory of analytic functions, notably their global aspects. 



Skewes (1955) 
Lehman (1966) 
te Riele (1987) 




xi<2x 10"^^ 
xi < 6.658 X 10' 



Bays and Hudson (1999) 



xi < 1.3982 X 10' 



,316 
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key prediction from Riemann's memoir and restate it in entirely elementary language: 

lcm[l, 2, 3, • • • , a;] ~ as a; — >• oo. 

Now, one can easily verify that 

(n^^) ^ ( n^') ^ ( n^') x---=icm[i,2,3,---,x], 



since the power of any prime p that divides the integer on either side of the equation is 
precisely the largest power of p not exceeding x. Combining this identity with the preceding 
approximation and taking natural logarithms of both sides, we obtain 

Inp j + ( Inp j + ( Inp J + • • • f=:i a; as a; — > oo. 

p<x ' p'^<x p^<x 



Notice that the primes in the first sum are precisely the primes counted by 7r(a;), the 
primes in the second sum are precisely the primes counted by n{x^/'^), and so on. A 
technique called partial summation allows us to "take care of" the Inp summand (which 
is analogous to using integration by parts to take care of a In a; factor in an integrand). 
When partial summation is applied to the above approximation, the result is 

rli 

TTix) + i7r(a;V2) + i^(^i/3) + £^ = Li(a;). 

If we "solve for n{x)" in an appropriate way, we find the equivalent form 

7t{x) ^ U(x) - |Li(a;^/^) + . . . . 



Hence Riemann's method yields the same prediction as Gauss's prediction, and it yields 
something extra as well — namely, it predicts a secondary term that will hopefully compen- 
sate for the overcount that we witnessed in the Gauss prediction. Let's review the data and 
sec how Riemann's prediction fares. "Riemann's overcount" refers to Li(a;) — |Li(y^) — 
7r(a;), while "Gauss's overcount" refers to Li(a;) — n{x) as before: 
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10^ 


5761455 


753 


131 


10^ 


50847534 


1700 


-15 


IQio 


455052511 


3103 


-1711 


IQii 


4118054813 


11587 


-2097 


10^2 


37607912018 


38262 


-1050 


1013 


346065536839 


108970 


-4944 


1014 


3204941750802 


314889 


-17569 


1015 


29844570422669 


1052618 


76456 


1016 


279238341033925 


3214631 


333527 


1017 


2623557157654233 


7956588 


-585236 


1018 


24739954287740860 


21949554 


-3475062 


1019 


234057667276344607 


99877774 


23937697 


1020 


2220819602560918840 


222744643 


-4783163 


1021 


21127269486018731928 


597394253 


-86210244 


1022 


201467286689315906290 


1932355207 


-126677992 



Table 6. Primes up to various and Gauss's and Riemann's predictions. 

Riemann's prediction does seem to be a little better than that of Gauss, although not 
a lot better. However, the fact that the error in Riemann's prediction takes both positive 
and negative values suggests that this might be about the best we can do. 

Going back to the key prediction from Riemann's memoir, we can calculate some data 
to test its accuracy: 



Nearest integer to 

X ln(lcm[l, 2, 3, • • • , a;]) difference 

100 94 -6 

1000 997 -3 

10000 10013 13 

100000 100052 52 

1000000 999587 -413 



Riemann's prediction has been made more precise over the years, and it can now be 
expressed very explicitly as 

|ln(lcm[l,2, ••■ ,x])-x\ < 2^ln x for alia; > 100. 

In fact, this inequality is equivalent^ to the infamous Riemann Hypothesis, perhaps the 
most prominent open problem in mathematics. The Riemann Hypothesis is an assertion, 
proposed by Riemann in his memoir and still unproven, about the zeros of a certain function 
from complex analysis that is intimately connected with the distribution of primes. To 
try to give some idea of what Riemann did to connect the two seemingly far-removed 
areas of number theory and complex analysis, we need to talk about writing functions as 
combinations of "waves" . 



^As is |Li(a;) — 7r(a;)| < ^/x\nx for all a; > 3. 
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Doing the wave. 

You have probably wondered what "radio waves" and "sound waves" have to do with 
waves. Sounds don't usually seem to be very "wavy", but rather are fractured, broken 
up, stopping and starting and changing abruptly. So what's the connection? The idea 
is that all sounds can be converted into a sum of waves. For example, let's imagine that 
our "sound" is simply the gradually ascending line y — x — ^, considered on the interval 
< a; < 1, which is shown in the first graph of Figure 1. 




Figure 1. The line y = x — ^ and the wave y = — ;^ sin27ra;. 

If we try to approximate this with a "wave", we can come pretty close in the middle 
of the line using the function y = sin27rx. However, as we see in the second graph of 
Figure 1, the approximation is rather poor when a; < | or x > |. 

How can we improve this approximation? The idea is to "add" a second wave to the 
first, this second wave going through two complete cycles over the interval < a; < 1 
rather than only one cycle. This corresponds to hearing the sound of the two waves at 
the same time, superimposed; mathematically, we literally add the two functions together. 
As it turns out, adding the function y — —-^ sin47ra; makes the approximation better for 
a range of x-values which is quite a bit larger than the range of good approximation we 
obtained with one wave, as we see in Figure 2. 




Figure 2. Adding the wave y = —-^ sin47ra; to the wave y = — ;^ sin27ra; 

We can continue on like this, adding more and more waves that go through 3, 4, or 
5 complete cycles in the interval and so on, to get increasingly better approximations to 
the original straight line. The approximation we get by using one hundred superimposed 
waves is really quite good, except near the endpoints and 1. 
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0.6 



0.2 



0.2 



0.4 



0.6 



0.8 



-0.2 



-0.6 



Figure 3. The sum of one hundred carefully chosen waves 



If we were to watch these one hundred approximations being built up one additional 
wave at a time, we would quickly be willing to wager that the more waves we allowed 
ourselves, the better the resulting approximation would be, perhaps becoming as accurate 
as could ever be hoped for. As long as we allow a tiny bit of error in the approximation 
(and shut our eyes to what happens very near the endpoints), we can in fact construct 
a sufficiently close approximation if we use enough waves. However, to get a "perfect" 
copy of the original straight-line, we would need to use infinitely many sine waves — more 
precisely, the ones on the right-hand side of the formula 



X — 



-2 E 



sin(27rna;) 
27rn 



which can be shown to hold for every < a; < 1. (We can read off, in the n = 1 and n = 2 
terms of this sum, the two waves that we chose for Figures 1 and 2.) This formula is not 
of much practical use, since we can't really transmit infinitely many waves at once — but 
it's a gorgeous formula nonetheless! 

In general, for any function j{x) defined on the interval < a; < 1 that is not "too 
irregular", we can find numbers a„ and 6„ such that f{x) can be written as a sum of 
trigonometric functions, namely 

j{x) = ao + {obn cos(27rna;) + hn sin(27rna;)) . 
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This formula, together with a way to calculate the coefficients a„ and b^, is one of the 
key identities from "Fourier analysis" , and it and its many generalizations are the subject 
of the field of mathematics known as harmonic analysis. In terms of waves, the numbers 
2ivn are the "frequencies" of the various component waves (controlling how fast they go 
through their cycles), while the coefficients an and bn are their "amplitudes" (controlling 
how far up and down they go). 

Riemann's revolutionary formula. 

Riemann's idea can be simply, albeit rather surprisingly, phrased in the following terms: 
Try counting the primes as a sum of waves. 
The precise formula he proposed is a bit too technical for this article, but we can get a 
good sense of it from the following approximation when x is large. This formula, while 
widely believed to be correct, has not yet been proved. 



(3) 



X 






# {primes < x} 

2 




sin (7 In a;) 


1 + 2 


y/x/ \nx 


7 


all real numbers 7>0 

such that 5+17 
is a zero of 



The numerator of the left-hand side of this formula is the error term when comparing the 
Gauss prediction Li(a;) to the actual count n{x) for the number of primes up to x. We 
saw earlier that the overcounts seemed to be roughly the size of the square root of x, so 
the denominator ^yx /Inx appears to be an appropriate thing to divide through by. The 
right side of the formula bears much in common with our formula for a; — 1/2. It is a 
sum of sine functions, with the numbers 7 employed in two different ways in place of 27rn: 
each 7 is Tised inside the sine (as the "frequency"), and the reciprocal of each 7 forms 
the coefficient of the sine (as the "amplitude"). We even get the same factor of 2 in each 
formula. However, the numbers 7 here are much more subtle than the straightforward 
numbers 27rn in the corresponding formula for x — 1/2: 
The Riemann zeta-function ({s) is defined as 

N 1 1 1 
C{s) = p + ^ + 3I + • • • • 

Here s is a complex number, which we shall write as s = a + it when we want to refer to 
its real and imaginary parts a and t separately. If s were a real number, we know from 
first-year calculus that the series in the definition of ('(■§) converges if and only if s > 1; 
that is, we can sum up the infinite series and obtain a finite, unique value. In a similar 
way, it can be shown that the series only converges for complex numbers s such that a > 1. 
But what about when cr < 1? How do we get around the fact that the series does not sum 
up (that is, converge)? 

Fortunately, there is a beautiful phenomenon in the theory of functions of a complex 
variable called "analytic continuation". It tells us that functions that are originally de- 
fined only for certain complex numbers often have a unique "sensible" definition for other 
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complex numbers. In this case, the definition of ({s) given above only works when cr > 1, 
but it turns out that "analytic continuation" allows us to define ({s) for every complex 
number s other than s = 1 (see [17] for details). 

This description of the process of analytic continuation looks disconcertingly magical. 
Fortunately, there is a quite explicit way to show how C(c + can be "sensibly" defined 
for the larger region cr > 0, at least. We start with the expression (1 — 2^~^)({s) and 
perform some sleight-of-hand manipulations: 

(l-2i-)C(.) 



Solving for C(s), we find that 

All of these manipulations were valid for complex numbers s — a + it for which a > 1. 
However, it turns out that the infinite series in curly brackets actually converges whenever 
cr > 0. Therefore, we can take this last equation as the new "sensible" definition of 
the Riemann zeta-function on this larger domain. Note that the special number s = 1 
causes the otherwise innocuous factor of 1/(1 — 2^"'^) to be undefined; the Riemann zeta- 
function intrinsically has a problem there, one that cannot be swept away with clever 
re-arrangements of the infinite series. 

Riemann's formula (3) depends on the zeros of the analytic continuation of C{s). The 
easiest zeros to identify are the negative even integers; that is 

C(-2) = 0, C(-4) = 0, C(-6) = 0, ... 

which are called the "trivial zeros" of the zeta-function. It can be shown that any other 
complex zero a + it of the zeta function (that is, a number satisfying C(c" + = 0) 
must satisfy < a < 1; these more mysterious zeros of the zeta-function are called the 
"non-trivial zeros" . 

After some calculation Riemann observed that all of the non-trivial zeros of the zeta- 
function seem to lie on the line Re(s) = 1/2. In other words, he stated the remarkable 
hypothesis: 

If cr -H is a complex number with < cr < 1 and C(cr -|- it) = 0, then a — ^. 



y--2 y -= y y 1 

n>l n>l m>l n>l 

n even odd even 

(---\ (---\ 



1* 2« j ^ V 3* 4V ^ V 5* 6* ' ^ 
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This assertion, which nobody has yet managed to prove, is the famous Riemann Hypothesis. 

If the Riemann Hypothesis is indeed true, then we can write all the non-trivial zeros 
of the zeta-function in the form p = ^ + «7 (together with their conjugates | — ^7, since 
C(l/2 + i'j) = if and only if C(l/2 ^ ^7) = 0), where 7 is a positive real number. These 
are the mysterious numbers 7 appearing in the formula (3), which holds if and only if 
the Riemann Hypothesis is true. There is a similar formula if the Riemann Hypothesis is 
false, but it is rather complicated and technically far less pleasant. The reason is that the 
coefficients I/7, which are constants in (3), get replaced by functions of x. So we want the 
Riemann Hypothesis to hold because it gives the simpler formula (3), and that formula is 
a delight. Indeed (3) is similar enough to the formulas for soundwaves for some experts to 
muse that (3) asserts that "the primes have music in them" . 

One might ask how we add up the infinite sum in (3)? Simple, add up by order of 
ascending 7 values and it will work out. 

Clever people have computed literally billions of zeros of Ci^)^ ^ind every single zero 
that has been computed does indeed sit squarely on the line a — 1/2. For example, the 
non-trivial zeros closest to the real axis are s = 1/2 + 271 and s = 1/2 — ^71, where 

71 Ri 14.1347 We believe that the positive numbers 7 occurring in the non-trivial 

zeros look more or less like random real numbers, in the sense that none of them are 
related to one another by simple linear equations with integer coefficients (or even by more 
complicated polynomial equations with algebraic numbers as coefficients). However, since 
about all we know how to do is to numerically approximate these non-trivial zeros to a 
given accuracy, we cannot say much about the precise nature of the numbers 7. 

Prime Race for 7r{x) versus Li{x). 

So, how do we use this variant on "Fourier analysis" to locate the smallest x for which 



^{primes < x} > J 

The idea is to approximate 



TV ^ ^ ^ / ifi - #{primes < x} 

^/x/lI^x y/x/lnx 

by using the formula (3). Just as we saw when approximating the line a; — 1/2, we expect 
to obtain a good approximation here simply by summing the formula on the right side of 
(3) over the first few zeros of C(s) (that is, the smallest hundred, or thousand, or million 
values of 7 with C(l/2 + ^7) = 0, depending upon the level of accuracy that we want). In 
other words, 

X 

/ 1^ - #{prinies < a;} 

2 Ril + 2 V sm(7lna:) 

^/x /lux ^ 7 ' 

^+17 is a zero of C(s) 
0<7<T 
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where we can choose T to include however many zeros we want. Figure 4 is a graph of the 
function® (Li(x) — 7r{x)) / {^Li{^/x)) as x runs from 10^ to 10®, together with three graphs 
of approximations using the first 10, 100, and 1000 values of 7 respectively. We see that 
the approximations get better the more zeros we take. 




1.6 
1 . 4 
1.2 

1 

0.8 
0.6 
0.4 
0.2 








Figure 4. A graphs of (Li(x) — 7r(a;))/(|Li(y^)), followed by approximations 
using 10, 100, and 1000 zeros of 

Extensive computations of this type led Bays and Hudson to conjecture that the first 
time that 7r{x) surpasses Li(x) is at approximately 1.3982 x 10^-*^^. 

The Mod 4 Race. 



^When X is large, ^'Li(y/x) « y/xl\a.x. We have preferred to use the latter expression because it 
consists of more familiar functions. However, when dealing with "small" values of x as in these graphs, 
using the function ^\A{yfx) lets us more clearly display the similarities we want to show. 
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In 1959, Shanks suggested studying the Mod 4 Race by drawing a histogram of the 
values of 



^{primes 4n + 3 < x} — TT^jprimes 4n + 1 < x} 

x/ Inx 



(4) 



Figure 5 displays such a histogram for the one thousand sample values x = 1000, 2000, 
3000, . . . , 10^. 



n 



Figure 5. A histogram for the values of (4) at x = lOOO/c, 1 < k < 1000 

This histogram is suggestive; one might guess that if we incorporate more and more sample 
values, then the histogram will more and more resemble something like a bell-shaped curve 
centered at 1. Littlewood's result (2) implies that the tail does stretch out horizontally 
to oo as more and more sample values are used, since the ratio in equation (4) will be at 
least as large as some constant multiple of In In Inx for infinitely many values of x.^ The 
infinite extent of the histogram is certainly not evident from the figure above, but this is 
not so surprising, since 

"In In Inx goes to infinity with great dignity." — Dan Shanks, 1959 

In (3) we saw how the difference between 7r(x) and Li(x) can be approximated by 
a sum of waves whose frequencies and amplitudes depend on the zeros of the Riemann 



^In fact, Littlewood also proved the inequality with the two terms on the left-hand side reversed, so 
that the histogram stretches out to — oo as well. 
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zeta-function. To count primes up to x of the form 4n + 1, or of the form An + 3, or 
of any form qn + a with gcd(a, q) = 1, there is a formula anafogous to (3) that depends 
on the zeros of Dirichlet L-functions, relatives of the Riemann zeta-function, which also 
have natural though slightly more complicated definitions. For example, the Dirichlet L- 
function associated to the race between primes of the form 4n + 3 and primes of the form 
4n + 1 is 

L{s) 



1 



+ 



111 

P ~ 3^ ^ 5^ ~ 7« 

Note that this (and all other Dirichlet L-functions) converges for any s = a + it with a > 0. 
There is a lovely formula, analogous to (3), for the number of primes of the form qn + a up 
to X. This formula holds if and only if all of the zeros of these Dirichlet L-functions in the 
strip < Re(s) < 1 satisfy Re(s) = 1/2. We call this assertion the Generalized Riemann 
Hypothesis. For example, if the Generalized Riemann Hypothesis is true for the function 
L(s) just defined, then we get the formula 



#{primes 4n -|- 3 < a;} — #{primes 4n -|- 1 < a;} 
^/x/ \nx 



1 + 2 



E 



sin (7 \nx) 



Real numbers 7>0 
such that 5+i7 
is a zero of L{s) 



7 



Consequently, we can try to study the Mod 4 Race by computing the right-hand side of 
this formula, truncating to involve only a hundred or a thousand or a million zeros of L(s). 
Here is a graph showing how well such an approximation with 1000 zeros agrees with the 
actual data: 





Figure 6. A graph of the function in (4), and an approximation using 1000 
zeros of L(s). 



The first three occasions that Team 1 takes the lead are clearly visible in both graphs 
(above we gave the exact values at which this happens). The convenient thing about 
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the approximation is that the approximation doesn't become hopelessly more difficult to 
compute as x becomes large, unlike determining the precise count of primes. In 1999 Bays 
and Hudson used such approximations to predict further "sign changes" in (4) for x up to 
;Lq10oo_ rpj^g ^g^^ ^^^^ ^Yiey predicted were 

fa 1.4898 X 10^^ which was found at 1,488,478,427,089; 
w 9.3190 X 10^^ 

Where do these biases come from? 

We have observed several examples of prime number races and of biases that certain 
arithmetic progressions seem to have over others. Although we have seen that such bias 
can be calculated through complicated formulae like (3), this explanation is not easily 
understood, nor does it give us a feeling as to how we might predict, without enormous 
calculation, which of two progressions typically dominates the other. Let's summarize the 
prime races we have seen so far and look for a pattern, hoping that we can find some short 
cut to making such predictions. We have seen that there seem to be, at least most of the 
time. 

More primes of the form 4n + 3 than of the form 4n + 1; 
More primes of the form 3n + 2 than of the form 3n + 1; 

More primes of the form lOn + 3 and lOn + 7 than of the form lOn + 1 and lOn + 9. 

Do you see a pattern? There's probably not enough data here to make a good guess. But 
let's check out one more four-way prime number race, this one with the forms 8n + 1, 8n + 
3, 8n + 5, 8n + 7 as the contestants: 



X 


8n+ 1 


8n + 3 


8n+ 5 


8n + 7 


IK 


37 


44 


43 


43 


2K 


68 


77 


79 


78 


5K 


161 


168 


168 


171 


lOK 


295 


311 


314 


308 


20K 


556 


571 


569 


565 


50K 


1,257 


1,295 


1,292 


1,288 


lOOK 


2,384 


2,409 


2,399 


2,399 


200K 


4,466 


4,495 


4,511 


4,511 


500K 


10,334 


10,418 


10,397 


10,388 


1 million 


19,552 


19,653 


19,623 


19,669 



Table 7. The number of primes of the form 8n + j for j = 1, 3, 5 and 7 up to 
various x. 

This is a strange race, since the only clear pattern that emerges is that there seem to be 
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More primes of the form 8n + 3, 8n + 5, and 8n + 7 than of the form 8n + 1. 

(In fact, this is true for all x between 23 and 10^.) But this "strangeness" should help 
students of number theory guess at a pattern, because such students know that all odd 
squares^° are of the form 8n + 1. 

During our discussion about lcm[l, 2, ...,n] and its connection with the prime number 
counting function tt{x), we saw that Riemann's prediction 7r(a;) + |#{primes p withp^ < x} 
is what is best approximated by and so it is the squares of primes that "account" 

for the bias that Li{x) has over 7r{x) for most of the time. 

These pieces of evidence^ ^ point to the squares of primes playing an important role in 
such biases. Let's check out the arithmetic progressions that the squares of primes belong 
to for the moduli above. 

For any prime p not dividing 4, we have p^ of the form 4n + 1; 
For any prime p not dividing 3, we have p^ of the form 3n + 1; 
For any prime p not dividing 10, we have p^ of the form lOn + 1 or 9; 
For any prime p not dividing 8, we have p^ of the form 8n + 1. 

Exactly right every time! It does seem that "typically" qn + a has fewer primes than qn + b 
if a is a square modulo q while b is not. 

What is the phenomenon that we are actually observing? 

We have been observing some phenomena where there is a "typical" bias in favour of 
one arithmetic progression over another, that is, one "usual" leader in the prime number 
race. So far, though, we have not exhibited a precise description of this bias nor defined a 
number that measures how strong the bias is. In an earlier section, we stated Knapowski 
and Turan's 1962 conjecture: if we pick a very large number x "at random" , then "almost 
certainly" there will be more primes 4n + 3 < x than 4n + 1 < x. (Obviously, such 
a conjecture can be generalized to the other prime races we considered). However, the 
supporting evidence from our data was not entirely convincing. Indeed, by studying the 
explicit formula (3), Kaczorowski (1993) and Sarnak (1994), independently, showed that 
the Knapowski-Turan conjecture is false! In fact, the quantity 

— there are more primes of the form 4n + 3 up to a; than of the form 4n + 1} 

does not tend to any limit as X ^ oo, but instead fluctuates. This is not a very satisfying 
state of affairs, though it is feasible that we can't really say anything more: it could be 
that the phenomena we have observed are only true for "small numbers", -"^^ and that the 
race looks truly "unbiased" when we go out far enough. 

However, this turns out to not be the case. In 1994, Rubinstein and Sarnak made an 
inspired observation. After determining that one progression does not dominate the other 

i°Note that (2m - 1)^ = 8(™) +1 = 1 (mod 8) for all odd integers 2m - 1. 
-"^^ admittedly somewhat circumstantial evidence 
"small" on an appropriate scale! 
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for any fixed percentage of the time, so that that precise fine of thinking seemed cfosed, 
Rubinstein and Sarnak said to themselves that perhaps the obvious way to count prime 
number races is not necessarily the correct way to count prime number races, at least in 
this context. They observed that if you count things a little differently (in technical terms, 
"if you use a different measure" ), then you get a much more satisfactory answer. Moreover, 
you get a whole panorama of results that explain all the observed phenomena in a way 
that feels very natural. 

Sometimes in mathematics it is the simplest remark that is the key to unlocking a 
mystery, not the hard technical slog, and that was certainly the case for this question. The 
key idea of Rubinstein and Sarnak is to not count 1 for each integer x < X for which there 
are more primes of the form 4n + 3 up to a; than of the form 4n + 1, but rather to count 
1/x. Of course, the total sum "^^^x l/^^ is not X, but rather approximately InX, so we 
need to scale with this instead. However, when we do so, we obtain a remarkable result: 

Rubinstein and Sarnak (1994): 



There are more primes 4n+3 
than 4n+l up to x 



.9959 ... as X ^ oo. 



In other words, with the appropriate method of measuring, Tchebychev was correct 99.59% 
of the time! 

Moreover their idea can be applied to all of the other prime number races that we have 
been discussing. For example, in the Mod 3 Race, we have 



- — > .9990 ... as X ^ oo. 



InX ^ X 

x<X 

There are more primes 3n+2 
than 3n+l up to x 

SO that Team 1 only has about one chance in a thousand of being ahead of Team 3 at any 
given time, when we measure "being ahead" this way. 

And what about the race between ti{x) and Li(x)? Remember that we don't expect to 
find a counterexample until we get out to the huge value x ~ 1.3982 x 10^^^. In this case, 
Rubinstein and Sarnak showed that 

-\- y - — > .99999973 ... as X ^ oo. 
InX ^ X 

x<X 
7r(a;)<Li(x) 

That is, with the appropriate method of measuring, -k^x) < Li(x) about 99.999973% of the 
time! No wonder that the first point at which ti{x) > Li(a;) is so far out. 

This successful way of measuring things is called the logarithmic measure (because of 
the "normalizing factor" InX). It appears in many contexts in mathematics, and was even 
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used in a related context by Wintner in 1941, but never quite in such an appealing and 
transparent way as by Rubinstein and Sarnak. 

Their wonderful results are proved under highly plausible assumptions. They assume 
first that the Generalized Riemann Hypothesis holds (in other words, marvellous formulae 
analogous to (3) can be used to count primes of the form gn + a up to x), and second that 
the 7's that arise in these formulae — the imaginary parts of the zeros of the associated 
Dirichlet L-functions — are actually linearly independent over the rational numbers. It 
would be shocking if either of these assumptions is false, but it does seem unlikely that 
either one will be proved in the near future. 

How do you prove such results? 

The proofs of the results mentioned above, and those discussed in the next sections, 
are too technical to describe in full detail here. However, they do all depend on careful 
examination of the formula (3) and its analogues for the Mod q Races. The assumption of 
the Generalized Riemann Hypothesis ensures, in all the cases under consideration, that a 
formula of the type (3) exists in the first place. To analyze its value, we need to consider 
the sum on the right side of (3) and, in particular, the values of the various terms and 
how they interact: Each term — 2sin(7lnx)/7 is a sine- wave which undulates regularly^^ 
between —2/7 and 2/7. As we have seen, when studying how such waves can combine 
to give approximations to a straight line, if the various values of 7 can be chosen so that 
somehow the individual undulations can be synchronized, then the sum can accumulate 
into surprising shapes. This leads to the second assumption: If we assume that the waves 
cannot combine in some extraordinary way, then the sum should remain small and unsur- 
prising. In other words, we would expect that a fair portion of the terms would be positive 
and a fair portion will be negative, so that there will be significant cancellation. This 
is what is achieved by assuming that the 7's are linearly independent over the rational 
numbers. Under this second assumption, the values of the sin(7lnx) terms cannot be well 
synchronized for very many values of x, and so we are able to prove results. In fact, we 
find that the smallest few 7's have the largest infiuence on the value of the sum (which is 
not surprising, since the term corresponding to 7 has a factor of 7 in the denominator). 

Calculating precise numerical values for how often one prime number count is ahead 
of another is even more delicate. It is no longer enough to argue that the sum is "large" 
or "small"; we need to know exactly how often the sum in (3) is greater than — i or 
less than — |, since that value is where the right-hand side switches between positive and 
negative. As it turns out, it is possible to calculate exactly how often this occurs if we 
use (traditional) Fourier analysis: We rewrite the sum of the waves in (3) in which the 
frequencies depend on the zeros of the Riemann zeta-function, as a sum of "Fourier waves" 
where the frequencies depend on the numbers 2iTn. Although this is too cumbersome 
to give in a closed form, one can, with clever computational techniques, approximate its 
numerical value with great accuracy. 

Squares and nonsquares: Littlewood's method. 



^"Regularly", that is, if we consider In a; to be the variable rather than x. In fact, this is really the 
reason that the "logarithmic measure" is most appropriate for our prime number counts. 
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On several occasions above we encountered consequences of Littlewood's 1914 paper-^^. 
The method is robust enough to prove some fairly general results but not typically about 
one progression racing another Indeed, for any odd prime q, let Team S be the set of 
primes that are congruent to a square mod q, and Team N be the rest of the primes. Using 
Littlewood's method one can show that each such team takes the lead infinitely often^^, 
and the lead up to x can be as large as c-y^ In In In a;/ In a; for some constant c > that 
depends only on q. One example of this is in the q = 5 race where S contains the primes 
with last digits 1 and 9 (as well as 5), so this is the Mod 10 Race again — recall that we 
observed in the limited data above that Team N typically held the lead. 

There are other races that can be analyzed by Littlewood's method, always involving 
partitioning the primes into two seemingly equal classes. In fact there is at least one such 
race for every modulus q, and often more than one. When q is not a prime, however, the 
descriptions of the appropriate classes become more complicated. 

More results from Rubinstein and Sarnak (1994). 

In a prime number race between two arithmetic progressions modulo g, when do we see 
a bias and when not? Is each arithmetic progression "in the lead" exactly 50% of the time 
(in the logarithmic measure) or not? More importantly perhaps, can we decide this before 
doing a difficult calculation? Above we saw that there seem to "usually" be more primes 
up to X of the form qn + b than of the form gn + a if a is a square modulo q and b is not. 
Indeed, under the two assumptions mentioned above, Rubinstein and Sarnak proved that 
this is true: the logarithmic measure of the set of x for which there are more primes of the 
form qn + b up to X than of the form qn + a is strictly greater than |, although always 
less than 1. In other words, any nonsquare is ahead of any square more than half the time 
though not 100% of the time. 

We can ask the same question when a and b are either both squares modulo q or both 
nonsquares modulo q. In this case, under the same assumptions, Rubinstein and Sarnak 
proved that 7^{primes qn + a < x} > ^{primes qn + b < x} exactly half the time. In 
fact they prove rather more than this. To describe their result, we need to define certain 
error terms related to these prime number counts and describe what we mean by their 
"limiting distributions". As we noted above, the values of the prime counting function 
^{primes qn + a < x} are all roughly equal as we vary over the integers a up to g that 
have no factor in common with q. In fact, we saw that the ratio of any two of them tended 
to 1 as a; — cxD. This implies that 

^{primes qn + a < x} 

^(^)70(^) ^1 aso^-oo, 

where 7r{x) = #{primes < x} as before and (f){q) is the number of positive integers a up 
to q for which gcd(a,g) = 1. We have seen that it is natural to look at the difference in 



■"■^Which actually only dealt with the tt(x) vs. Li(a;) race. 
^^Though that was all we saw in the cases above where q is small. 

^^In other words, there are arbitrarily large x and y, such that there are more primes in S than in N 
up to x; and there are more primes in N than in S up to y. 
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these quantities divided by ^/xj Inx, so let us define 

#{primes qn-\- a <x) — TT{x)/(j){q) 



Error(x; g, a) = 



y/x/ In a; 



Rubinstein and Sarnak suggested that to properly study prime races between arithmetic 
progressions a (mod q) and h (mod q), one should look at the distribution of values of the 
ordered pair 

(Error (a;; g, a), Error(a;; g, 6)) 

as X varies, this distribution again being defined with respect to the logarithmic measure. 

More concretely, given real numbers a < (3 and a' < /3', one might ask how often a < 
Error(a:; g, a) < P and a' < Ettot{x; q,b) < j3' . This frequency is defined by the limit of 
integrals 

lim I o<y<Y, -dy 

Y^oo mr J Error(2/;g,a)e[a,/3] V 
and Error(j/;g,6)G[a',/3'] 

(a limit that they prove exists). Rubinstein and Sarnak proved that if a and h are both 
squares or both nonsquares modulo g, then this distribution is symmetric, that is, 

a < Error(a;; q,a) < (3 and a' < Error(a;; g, b) < (5' 

as often as 

a' < Error(a;; g, a) < (3' and a < Error(a;; g, h) < (3. 

In other words, there is no sign of any bias, of any form, at all: the arithmetic progressions 
a (mod g) and h (mod g) are interchangeable in the limit. 

This was all studied in much more generality. For example, one can ask about the 
twenty-four possible orderings of the four prime number counts 

^{primes Sn+l < a;},#{primes 8n+3 < a;},#{primes 8n-|-5 < a;},#{primes 8n-|-7 < x}. 

Rubinstein and Sarnak showed that each ordering occurs for infinitely many values of x; in 
fact, each ordering occurs a positive proportion of the time (in the logarithmic measure). 
This can be generalized to all moduli g and to as many distinct arithmetic progressions 
ai, . . . Oj. (mod g) as we like. That is, one can study the distribution of 

(Error(x; g, ai), Error(x; g, 02), . . . , Error(x; g, a^)). 

Assuming only the Generalized Riemann Hypothesis, they prove that the distribution 
function^^ for this vector of error terms exists. Assuming also the linear independence of 



-"^^Defined by a certain limit of integrals, analogous to the limit a few lines earlier. 
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the 7's over the rational numbers, they prove that for any distinct ai, . . . (mod g), none 
of which have any factors in common with g, the ordering 

^{primes qn + ai < x} < ^{primes qn + a2 < x} < ■ ■ ■ < ^{primes gn + a^. < x} 

occurs for a positive proportion (under the logarithmic measure) of values x. They also 
proved that, for any fixed r, these proportions get increasingly close to 1/r! as g gets 
larger. It seems extremely unlikely that this proportion is usually exactly 1/r!, but we 
cannot prove that it is or isn't except in the following special situation. 

Above we saw that the distribution function is symmetric when we have a two- progression 
race and the progressions are either both squares or both nonsquares modulo g. Surpris- 
ingly, Rubinstein and Sarnak showed that there is only one other situation in which the 
distribution function is symmetric no matter how we swap the variables with one another, 
namely the race between three arithmetic progressions of the form 

a (mod g), aco (mod g), and aco'^ (mod g) 

where uj^ = 1 (mod g) but lv ^ 1 (mod g). 

However, Rubinstein and Sarnak's result that the distribution function is not usually 
symmetric still leaves open the possibility that each ordering in a race occurs with the 
same frequency 

Earlier we saw Shanks' histogram of values of Error(a;; 4, 3) — Error(a;; 4, 1) for various 
values of x, and guessed that "the histogram will more and more resemble something like 
a bell-shaped curve centered at 1" as we take more data points. One perhaps surprising 
consequence of the work of Rubinstein and Sarnak is that this most obvious guess is not 
the correct one: Although there is a distribution function, it will not be as simple and 
elegant as the classical "normal distribution" bell curve. 

What research is happening right now? 

After Littlewood's great (and, at the time, quite surprising) results in 1914, there was a 
lull in research in "comparative prime number theory" until the fifties and sixties. At that 
time, Littlewood's ideas were extended in several theoretical directions that were arguably 
suggested directly by Littlewood's work, and by substantial calculations of Shanks, Hudson, 
and others. By the nineties, it appeared that most of what could be done in this subject 
had been done (and that much would never be done), although Kaczorowski was still 
proving some new results at this time. 

In 1993, Giuliana Davidoff taught a senior-level undergraduate course at Mount Holyoke 
College in analytic number theory. Finding so many interested students. Professor Davidoff 
decided that she could run a fun ''^Research Experience for Undergraduate^^ (REU) that 
summer investigating prime number races^^. Along with students Caroline Osowski, Jen- 
nifer vanden Eynden, Yi Wang, and Nancy Wrinkle, she did some important calculations 
and proved several results that will be given in the first appendix to this article. 



Since (for example) a function can be positive half the time without being symmetric. 
Under the more formal title "Topics in comparative number theory'\ 
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By chance, Davidoff met Sarnak on summer vacation. She described her REU project 
and how they were developing the computational approach of Stark to such problems. Sar- 
nak writes^*^: "I was not familiar with this Chebyshev bias feature and became fascinated. 
In particular I was . . . very interested to put a definite number as to the probability of 
n{x) beating Li(a;)." 

Sarnak continues: "When I got back to Princeton I chatted with Fernando^^ about 
this . . . and began to work on this ... It was clear that many zeroes would have to be 
computed" . Sarnak had previously discussed other questions on the distribution of prime 
numbers with a bright undergraduate, Mike Rubinstein. "Mike, who was looking for a 
senior thesis topic, got very involved in this and after a while he and I worked on the 
problem as collaborators and this led to our paper." This was quite an extraordinary 
senior thesis, as it became one of the most influential papers in recent analytic number 
theory. Subsequently Rubinstein went on to his Ph.D.^^ and has become one of the world's 
leading researchers in the computation of difi'erent types and aspects of zeta-functions. 

Soon thereafter, the second author read Rubinstein and Sarnak's paper and became 
interested in determining the "probabilities" for some of the three-way prime number 
races that Rubinstein and Sarnak's work did not address — for example, the race between 
the three contestants 

7^{primes 8n -|- 3 < x}, ^{primes 8n -|- 5 < a;} and ^{primes 8n -|- 7 < x}; 

note that none of these arithmetic progressions contain any squares. 

With colleague Audrey Feuerverger at the University of Toronto^"^, he created a tech- 
nique to determine the frequencies of the various orderings of contestants in a prime number 
race with more than two contestants. Unexpectedly they found that the six orderings of 
the three contestants in the race above do not necessarily each happen one sixth of the 
time. In fact 

^{primes 8n -|- 3 < x} > ^{primes 8n -|- 5 < x} > #|primes 8n -|- 7 < x} 

holds for approximately 19.2801% of integers x (in the logarithmic measure), while 

^{primes 8n -|- 5 < a;} > #{primes 8n -|- 3 < x} > #{primes 8n -|- 7 < a;} 

holds for approximately 14.0772% of integers x (under appropriate assumptions). 

This is strange, for in the race between primes of the form 8n -|- 3 and primes of the 
form 8n + 5 both contestants are in the lead half the time, yet if we only look at values of 
X for which both of these prime number counts are ahead of the count for primes of the 
form Sn + 7, then the 8n -|- 3 primes are more likely to be ahead! 

^"^In an email to the first-named author. 

^-"^ Fernando Rodriguez- Villegas, now of the University of Texas at Austin, but then a Princeton post- 
doctoral fellow who had lots of computational experience. 

Michael Rubinstein is now an assistant professor at the University of Waterloo. 
^^Where he was a postdoctoral fellow at the time before moving to the University of British Columbia. 
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More esoteric races. 

There are results known that give asymptotics for the number of primes having certain 
properties. For example, when is 2 a cube modulo the prime p? It is easy to see that this is 
the case for all primes of the form 3?i + 2, and so we focus on the other primes. It is known 
that 2 is a cube for one-third of the primes of the form 3n + 1, asymptotically. So we ask 
whether, typically, slightly more or slightly less than one-third of the primes p = 3n + 1 
have the property that 2 is a cube modulo p. 

In his recent doctoral thesis at the University of British Columbia, Nathan Ng^* noted 
that this question can be answered using similar techniques to Rubinstein and Sarnak, 
since the count of these primes depends in an analogous way on the zeros of yet another 
type of L-function. Indeed Ng observed that such results can be generalized to many cases 
in which we know the asymptotics for a set of primes that can be described by Galois 
theory^^. 

Ng gave the following nice example of his work. Power series like 

oo oo 
n=l n=l 

appear frequently in arithmetic geometry; this is an example of a modular form, and has 
all sorts of seemingly miraculous properties (see Serre's book [Se]). One such miracle is 
that for every prime p, the value of ap is 2, 0, or —1 and that the proportions of each are 
around 1/6, 1/2 and 1/3, respectively. Under the appropriate assumptions Ng showed, in 
the logarithmic measure, that 

2#{p <x,ap = 0}> 6#{p <x,ap = 2} for ^ 98.30% of the values of x, 
'^i^ip < x,ap = 0} > 3#{p < x,ap = —1} for ^ 72.46% of the values of x, and 
3#{p <x,ap = —1} > 6#{p <x,ap = 2} for f» 95.70% of the values of x. 

Prime pairs. 

The first author's involvement in this topic came from an invitation to speak at an MAA 
meeting in Montgomery, Alabama in 2001, which gave him an excuse to read up on this 
fascinating subject. A subsequent discussion with students led to the idea of creating a 
2001-2002 VIGRE^^ research group at the University of Georgia to investigate analogous 
questions about twin prime races: 

For any even number d, we believe that there are infinitely many integers n for which 
both n and n + d are prime. When d = 2, this is the famous Twin Prime Conjecture. 
At present, we cannot prove that there are infinitely many "prime pairs" n,n + d for any 

^'^Nathan Ng is now an assistant professor at the University of Ottawa. 

^^The L-functions in question are Artin L-functions, and the asymptotics for conjugacy classes under 
the appropriate Frobenius maps are obtained by Cebotarev's density theorem. Analogous results are 
proved under the assumption of holomorphy for these L-functions as well as the Riemann Hypothesis and 
linear independence over the rationals of the zeros of these L-functions. 

^^The latest acronym to come out of Washington, a new type of grant which encourages "Vertical 
Integration" of research, from bright undergraduates through senior faculty. 
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value of d, whatsoever. Nonetheless, we can predict how many there should be up to x. 
Indeed, we believe that 

^{prime pairs n,n + 2^ < x} 

-— r ^ — > 1 as a; — > OO 

#{prime pairs n,n + 2'^ < x} 

for any positive integers j and k. Comparing the first four powers of 2, we obtain the 
following data: 



n < X 


n, n + 2 


n,n + A 


n, n + 8 


n, n + 16 


100 


8 


9 


9 


9 


200 


15 


14 


14 


13 


500 


24 


27 


24 


24 


1000 


35 


41 


38 


39 


2000 


61 


65 


63 


60 


5000 


143 


141 


141 


135 


10000 


205 


203 


208 


200 


20K 


342 


344 


353 


331 


50K 


705 


693 


722 


707 


lOOK 


1224 


1216 


1260 


1233 


200K 


2160 


2136 


2194 


2138 


500K 


4565 


4559 


4641 


4631 


1 million 


8169 


8144 


8242 


8210 



Table 8. The number of prime pairs n, n + 2'^ up to a;, for A; = 1, 2, 3, 4 and 
various values of x. 

It appears from this initial data that there is a bias in favor of prime pairs n, n + 8, at 
least once x is ten thousand or more. However, there is no explicit formula like (3) known 
that would lead to a fruitful attack on this question. In the second appendix to this paper, 
we present the work of a team of students at the University of Georgia that sought further 
data and tried to make predictions. 

There are several other natural questions pertaining to prime number races that have 
been the subject of research over the last few years. 

It can take a long time to take the lead. 

Although Team 1 does occasionally get ahead in the Mod 4 Race, it takes quite a while 
for this to happen. Indeed x is larger than 25,000 the first time Team 1 takes the lead and 
then only momentarily; the first time Team 1 sustains the lead over a longer interval x is 
larger than half a million. Similarly, in the Mod 3 Race, x is larger than half a trillion 
before Team 1 takes the lead. In the 7r{x) versus Li(a;) race, we still do not know exactly 
when tt{x) first takes the lead though the answer is definitely gigantic. 

Another situation is where one Team is ganged up on by the other Teams. For instance 
we saw that primes of the form 8n + 1 lag far behind primes of the forms 8n + 3, 8n + 5 or 
8n + 7 (because all the odd squares, including squares of primes, are of the form 8n + 1, a 
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heavy burden on Team 1). In fact, the first time that Team 1 even gets out of last place is 
beyond half a billion: x = 588, 067, 889 is the first time that there are more primes of the 
form 8n + 1 up to X than there are primes of the form 8n + 5; then x ~ 1.9282 x lO"*^^ is 
the first time that there are more primes of the form 8?i + 1 up to x than there are primes 
of the form 8n + 7. Kaczorowski and, independently, Rubinstein and Sarnak showed that 
Team 1 runs in first place in this four-way race for a positive proportion of values of x; 
however, the first time this occurs is beyond 10^®. 

So why does it take so long for some competitors to get their turn in the lead? To 
understand this we need to examine the most "important" terms in formulas like (3), 
namely the "1" term and the first few summands, those for which 7 is small. Looking 
closely at the formula (3), these first few summands, which correspond to the nontrivial 
zeros of the Riemann zeta function that are closest to the real axis, have 7 approximately 
equal to 14.13, then 21.02, 25.01, 30.42, .... In particular, these summands are all < 1/7 
in absolute value, which is small relative to the "1" term. Even if we could find a value of 
X for which many of the initial terms sin(7lna;) are close to —1, it would take at least the 
first twenty-one terms of the sum to be negative enough to compensate for the initial "1" 
term. Thus these values of x must be special in that many of the sin(7lna;) terms must 
be close to —1 simultaneously. 

If we look at the Mod q Race as q gets larger, it turns out that the relevant numbers 7 
become smaller (that is, the zeros of the appropriate Dirichlet L-functions lie closer to the 
real axis), and therefore changes in the lead happen sooner and more frequently. 

However, tiny values of 7 come with their own problems. For example, when q = 163 
the relevant sum has an especially small 7, namely 7 ^ 0.2029. So this one summand 
has a large impact on the final answer since the denominator is so small. The first time 
sin(7lna;) is close to —1 is when 7 In a; is close to 37r/2; which corresponds to a value of 
X around 12 billion! And if that x doesn't work out because of the other terms then the 
next such value is when 7lna; is close to 77r/2, which corresponds to x ^ 3.43 x 10^^. No 
wonder it can take a while to see the predicted effects! 

Sharing the lead. 

In a race between two contestants who keep taking the lead from one another, there will 
be plenty of moments when the two contestants are tied: Since there are arbitrarily large 
values of x for which 7^{primes qn + a < x} > ^{primes qn + b < x}, and also arbitrarily 
large values for which 7^ {primes qn + a < x} < primes qn + b < x}, and since these 
counting functions take only integer values, there must be infinitely many integers x for 
which ^{primes qn + a < x} = #{primes qn + b < x}. 

What about races with three or more contestants? Even if each of the contestants leads 
at some point, there seems to be no particular reason for all three of them to be tied at 
any point. So we ask whether there exist infinitely many integers x for which 

#{primes qn + a < x} = #{primes qn + b < x} = #{primes qn + c < x} = ■ ■ ■ . 

Feuerverger and Martin conjecture that there are infinitely many such ties in a three-way 
Mod q Race, but not in a Mod q Race with four or more teams. Their compelling argument 
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runs as follows: Consider the {k — 1) -dimensional vector whose ith entry is the difference 

(5) #{primes qn + ai < x} — #{primes qn + a^+i < x} 

for each 1 < i < k — 1. Notice that the k counting functions ^{primes qn + ai < x} are 
all tied with one another precisely if this {k — 1) -dimensional vector equals (0,0, .. .,0). 
As we let x increase, this vector changes each time x equals a prime number in one of the 
arithmetic progressions we are counting, changed by adding one of the vectors 

(6) (1,0,..., 0), (-1,1,0,...,0), ■■■ , (0,..., 0, -1,1,0,... , 0), ■■■ , (0,...,0,-l), 

depending on whether the prime is of the form qn + ai or qn + a2 or ... qn + ak, respectively. 
The prime number theorem for arithmetic progressions tells us that each of these vectors 
is roughly equally likely to occur. 

From this, Feuer verger and Martin suggest that the progress of the {k — l)-dimensional 
vector in (5) can be modelled by a "random walk" on the [k — l)-dimensional lattice 
generated by the vectors listed in (6). Now it is known that with probability 1, a random 
walk on a one- or two-dimensional lattice will return to the origin (indeed, will visit every 
lattice point) infinitely often, but a random walk on a lattice of dimension three or greater 
will return to the origin only finitely often. This is the source of Feuerverger and Martin's 
conjecture, since the lattices of dimension one and two in this model correspond to prime 
number races with two and three contestants, respectively. 

Certain special symmetries. 

Assuming the Generalized Riemann Hypothesis, Feuerverger and Martin proved that 
certain configurations 

^{primes qn + ai < x} < ^{primes qn + a2 < x} < ■ ■ ■ < ^{primes qn + ar < x} 
occur just as often^^ as certain other configurations 

7^{primes qn + bi < x} < ^^{primes qn + b2 < x} < ■ ■ ■ < ^{primes qn + br < x} 
in the following situations: 

• The a's and 6's are inverses of each other modulo q; that is, aibi = 1 (mod q) for all i; 

• The list of 6's is just the reversal of the list of a's, namely bi = ar+i-i for all i, and are 
either all squares modulo q or all nonsquares modulo q; 

• There exists an integer m such that = mbi (mod q) for each i, and in addition, one 
of the following holds: 

• The a's are all squares modulo q; or 

• For each i, the two numbers and bi are either both squares modulo q or both 
nonsquares modulo q; or 



^^With respect to the logarithmic measure of the set of such values x. 
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• For each i, exactly one of and bi is a square modulo q. 

Probably the orderings appear with different frequencies if they are not related by some 
special symmetry. 

And what if the Riemann Hypothesis is false? 

Certain famous results in number theory have been proved in two parts: first under 
the assumption that the Riemann Hypothesis is true, and second under the assumption 
that the Riemann Hypothesis is false. The groundbreaking work of Sarnak and Rubinstein 
followed from two hypotheses, the first of which was the Generalized Riemann Hypothesis. 
One might ask whether we can obtain similar results, perhaps with a rather different proof, 
under the assumption that the Generalized Riemann Hypothesis is false. 

If the Generalized Riemann Hypothesis is false, it turns out to be easier to prove that 
^{primes qn + a < x} > #{primes qn + b < x} for a positive proportion^^ of x, since the 
"1" term in the formula analogous to (3) becomes irrelevant, and this was what caused the 
bias when we assumed the Generalized Riemann Hypothesis. However, whether one can 
prove that this happens 50% of the time is still in doubt. If one could prove this under 
the assumption that the Generalized Riemann Hypothesis is false, then we would have an 
unconditional proof that the Mod q Race between two squares or two nonsqiiares is always 
an even race (by combining such a proof with the work of Rubinstein and Sarnak) . 

The behavior of prime number races might be genuinely different if the Generalized Rie- 
mann Hypothesis does not hold. In 2001, Ford and Konyagin proved that if the Generalized 
Riemann Hypothesis is false — false in an absurdly convenient, yet feasible, way — then there 
are some orderings of the prime number counting functions that never occur. 

To describe one of their constructions, we need to discuss a generalization of the Riemann 
zeta-function. Define x to be that function for which 

x(5m) = 0, x(5"7. ± 1) = ±1, and x(5n7, ± 2) = ±z for all integers m; 

and let L(s,x) = X^n>i ^(^)/^*- (This is another example of a Dirichlet L-function.) 
Suppose that the Generalized Riemann Hypothesis is true except that L{s,x) has a single 
zero a + ij with cr > 1/2, 7 > 0, and that L(s, x) has a zero at cr — Z7 as well (zeros always 
come in conjugate pairs). The formula analogous to (3) is 

^{primes 5n + a < x} - ^^{x) ^ / cos(7logx) + z sin(7loga;) 

~ -zRe I Xl^j 



x'^/liix \ a + i'j 

for 1 < a < 4. If we work this out explicitly, we see that 

^ ^ #{primes 5n+l<x}- ^wix) . , n ■ ( ^ \ 

2(0" +7) T^j ~ — a"cos(7lnx) — 7sm(7lnx) 



1. 2 , 2^ #{prinies 5n+l< x}- |7r(a;) 
2^"" ^ x-/\nx 
1/9 9\ #{prinies 5n + 2 < x} — j7r{x) 

i((7 +7 ) 7^ 

2^ ^ x'^/lnx 



-\a +7 ) ~ a"sin(7lnx) — 7cos(7lnx) 
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i( 2^ 2x#{P™es 5n+3 <a;}- |7r(a;) 

Tj((J +7 I 7^; ~ — (TSin(7ma;) + 7COS(7ma;) 

2^ ' x"" /Inx 

\( 2 , 2^ #{prinies 5n + 4 < a;} - \k{x) 

o(c +7 ) n a; (Jcos(7ma;) + 7sm(7ma;) 

Amazingly, this implies that the configuration 
#{primes 5n+3 < a:} < #{primes 5n+2 < a;} < #{primes 5n+4 < < #{primes 5n+l < 
cannot occur when x is sufficiently large! Why? The first inequality implies that 

—o sin(7 In x) + 7 cos(7 In x) ^ cr sin(7 In x) — 7 cos(7 In x) 

<^=^ ^ (Tsin(7lnx) — 7COs(7lna:), 

where the ^ symbol means that the inequality holds up to an error that goes to zero as x 
tends to infinity. Similarly, the third inequality implies that a cos(7 In a;) +7 sin(7 In a;) ^ 0. 
The three-inequality configuration is therefore equivalent to 

^ crsin(7lna:) — 7 003(7 In a;) ~ cr 003(7 In a;) + 73in(7lna:) ^ 0, 

which implies both crsin(7lna:) — 7 003(7 In a:) ~ ^"^^ (1003(7 In a;) + 7sin(7lna:) ~ 
0. However, multiplying by sin(7lna;) and 003(7 In a;) respectively yields cr sin^ (7 In a;) — 
7COs(7lna;) sin(7lna;) and crcos^(7lna;) + 7COs(7lna;) sin(7lna;) 0, and adding 
these together gives 

cr = cr sin^ (7 In x) + cr cos^ (7 In a;) ~ 0. 

But this is ridiculous, as it says that the constant cr > i tends to as x goes to infinity. 

Ford and Konyagin proved more. For any three arithmetic progressions modulo there 
is a feasible (but equally unlikely) way to prescribe Generalized-Riemann-Hypothesis- 
violating zeros of the appropriate Dirichlet L-functions that would cause one of the six 
orderings of the three arithmetic progressions not to occur beyond a certain a:-value. More- 
over, the zeros in these configurations can be placed arbitrarily far from the real axis, which 
implies that there is no way for us to rule out such a possibility by a finite computation. 

It seems that their method can be extended to show that certain races between just two 
arithmetic progressions keep the same leader from some point onwards provided there is 
another technically feasible, but rather unlikely, contradiction to the Generalized Riemann 
Hypothesis. 

Other similar constructions appear in Ford and Konyagin's work. Moving up to r-way 
races with r > 4, they have constructed feasible configurations of "violations" which permit 
no more than of the r! possible orderings of r arithmetic progressions to occur infinitely 
often. 

It thus seems that it will be very difficult to obtain results independent of any unproved 
hypothesis. 

Acknowledgements. The section "Doing the wave" was inspired by Enrico Bombieri's de- 
licious article [Bo]. Thanks to Carter Bays, Kevin Ford, Richard Hudson and Nathan Ng 
for making available parts of their works that have not appeared in print. Thanks also to 
Guiliana Davidoff and Michael Guy for preparing the appendices. 
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Appendix One. A Mount Holyoke College REU. 
by Giuliana Davidoff 

The research group consisted of students Caroline Osowski, Jennifer vanden Eynden, Yi 
Wang, and Nancy Wrinkle. 

In my senior class in analytic number theory, I had been asking students to collect data 
on various pertinent questions so as to conjecture a forthcoming theorem, and then to 
report their results to the whole class. Having seen many examples where the theorems 
matched the initial experimental evidence, I asked them to collect data on the Mod 4 Race, 
and then surprised them with Littlewood's Theorem. Next we decided to investigate the 
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Mod 3, 5, 7, and 11 Races, in particular whether Team N, the set of primes that are 
not congruent to a square mod consistently leads over Team S, the set of primes that 
are congruent to a square mod q. The results found were intriguingly different from 
one modulus to another, so we decided to study this during the upcoming summer REU 
program. 

The students began by tracking down anything in the existing literature on sign changes 
and posting an appeal on a number theory web site; within days we had received a generous 
response from Andrew Odlyzko who pointed us to relevant data of Robert Rumely, allowing 
us to begin our own work in earnest. 

We were most fascinated by a 1971 paper of Harold Stark [27], in which he suggested a 
method to study prime races between primes from any two given arithmetic progressions. 
Back then, there was no general procedure known to prove that each team took the lead 
infinitely often in the race between the primes of the form gn + a and the primes of the 
form qn + 6, where a is a square mod q and h is not. Stark's results thus pertain to the 
first case not covered by the prior work of Littlewood [15] and of Knapowski and Turan 
[25, 26]. As he himself pointed out, it seemed particularly difficult to show qn + a leads 
infinitely often, even in the case of primes of the form 5n + 4 racing against primes of the 
form 5n + 2. 

Stark rephrases the problem in terms of two auxiliary functions (which are complicated 
expressions, analogous to (3), given in terms of zeros of various Dirichlet L- functions) 
and, in doing so, creates a beautiful setting in which he obtains a theorem from a nu- 
merical calculation. In this manner he was able to show that ^{primes 5n + 4 < x} > 
^{primes 5n + 2 < x} for arbitrarily large x, assuming the Generalized Riemann Hy- 
pothesis. In fact he proved this with a weaker assumption than the Generalized Riemann 
Hypothesis: One needs only that the Dirichlet L- functions associated to this race have no 
real zeros in the interval (1/2, 1). 

The REU group began by making a detailed numerical study of Stark's formula (for his 
Mod 5 Race), in order to appreciate how his complicated expressions vary with the actual 
numbers of primes: we found that there are very good correlations, though a little less 
so when we use fewer zeros to approximate the formula analogousto the right-hand side 
of (3). 

Using Stark's results and our own numerical calculations, we were able to prove the 
by-now expected result in the first cases not treated by him: 

Theorem [23]. Assume that Dirichlet L-functions have no real zeros in the interval 
(1/2,1). For a = 1, 2, or 4 (the squares mod q) and 6 = 3, 5, or 6 (the non-squares 
mod q), there are arbitrarily large values of x for which ^{primes 7n + a < x} > 
jj^{primes 7n-\-h < x} . In fact there exists a constant c > such that ^{primes 7n-\- a < 
x} > ^{primes 7n + b < x} > c^fxj logo;. 

The proof of the theorem is based on calculations of Stark's formula using available 
tables of zeros of Dirichlet L-functions. We could have perhaps gone on to settle this 
question about races, one prime modulus at a time. 

However, the question that initially interested us was whether Team S takes the lead 
over Team N for arbitrarily large x. To study this we had to appropriately modify Stark's 
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formulas: this time we found that the formulas only involve the zeros of the Dirichlet 
L-function 



where {n/q) = 1 if n is a square mod (n/q) = if q divides n, and (n/q) = — 1 if n is 
not a square mod q. 

We found the same remarkable correlations between the actual count of primes and our 
approximation; had the summer not ended, we would surely have proved that the lead 
changes hands infinitely often in this Mod 7 race. 

I was later able to go on and prove [23] that the lead changes hands infinitely often in the 
Team S vs. Team N race, for any modulus g, assuming a weak version of the Generalized 
Riemann Hypothesis: One only needs to assume that any real zero of the above Dirichlet 
L-function lies to the left of the least upper bound of the real parts of the complex zeros 
(which holds, in particular, when the Dirichlet L-function has no real zeros). 
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Appendix Two. A University of Georgia VIGRE research group 



The research group consisted of students Michael Beck, Zubeyir Cinkir, Michael Guy, Brian 
Lawlcr, Eric Pine, Paul Pollack, and Charles Pooh, with postdoctoral mentor Jim Solazzo. 

The twin prime problem, that there are infinitely many prime pairs p,p -\-2, is one of 
the most famous unsolved problems of classical number theory. There are many general- 
izations, but here we shall focus on the question of whether there are infinitely many prime 
pairs p,p-\-2k for any positive even integer 2k. The following conjecture as to how many 
such pairs there are up to x is due, essentially, to Hardy and Littlewood [10]. 

The Hardy-Littlewood conjecture. Let k be a positive integer, and let 7r2k{x) be the 
number of prime pairs p,p-\-2k with p < x. Then 




n>l 



References 



by Michael Guy. 



7r2k{x) - 2C2 n 



p — 1 



Li2{x), 



p\k, p>2 



p-2 



where 
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We decided to investigate this problem by collecting and analyzing data. Our program 
found pairs of primes using a modified Eratosthenes' sieve and then counted them. Some 
of the initial data is collected in the following table: 



X 






MX) 


7r8(^) 


TTiofX) 


103 


35 


41 


74 


38 


51 


10^ 


205 


203 


411 


208 


270 


10^ 


1,224 


1,216 


2,447 


1,260 


1,624 


10^ 


8,169 


8,144 


16, 386 


8,242 


10, 934 


10^ 


58,980 


58,622 


117, 207 


58, 595 


78,211 


108 


440, 312 


440, 258 


879, 908 


439, 908 


586,811 


10^ 


3,424,506 


3,424,680 


6, 849, 047 


3,426,124 


4,567,691 


lO^o 


27,412,679 


27,409,999 


54,818,296 


27,411,508 


36, 548, 839 


10" 


224, 376, 048 


224,373, 161 


448, 725, 003 


224,365,334 


299, 140, 330 


10^2 


1,870,585,220 


1,870,585,459 


3,741,217,498 


1,870,580,394 


2,494,056,601 



Table 9. 7r2fe(a;) is the number of prime pairs p,p + 2/c with p < x. 

Notice that the counts for 2fc = 2, 4 and 8 are very close. We graphed the data for each 
2k < 30, noticing that TT2k{x) and Ti2t{x) are close if the prime factors of 2k and of 21 
are the same. Indeed, Hardy and Littlewood's conjecture predicts that 'n2k{x) ~ n2£{x) in 
these circumstances, and so we felt it would be interesting to study this "twin-prime race". 

If we wish to compare 7r2{x) and 7rQ{x) then it makes sense to "renormalize" so that the 
predictions of Hardy and Littlewood are the same for both. In other words, the conjecture 
predicts that ^ttq{x) should be approximately 7r2{x), so it makes sense to compare these 
two quantities. More generally, we define 

7r^fc(X) = 7r2fc(X) • Yl for each k>l, and = 2C2 • U2{X). 

p\k, p>2^ 

Hardy and Littlewood's conjecture predicts that 7r2^(X) ~ 7r//i,(X) for all k, so below we 
have tabulated their difference. 



X 




'^'2 - ''^HL 


- T^HL 




7I"8 - T^HL 




103 


45 


-10 


-4 


-8 


-7 


-7 


10^ 


214 


-9 


-11 


-9 


-6 


-12 


10^ 


1,248 


-24 


-32 


-25 


12 


-30 


10^ 


8,248 


-79 


-104 


-55 


-6 


-48 


10^ 


58, 753 


227 


-131 


-150 


-158 


-95 


108 


440, 367 


-55 


-109 


-413 


-459 


-259 


10^ 


3,425,308 


-802 


-628 


-785 


816 


460 


lO^o 


27,411,416 


1,263 


-1,417 


-2,268 


92 


213 


10" 


224, 368, 866 


7,182 


4,295 


-6,365 


-3,532 


-13,619 


10^2 


1,870,559,881 


25, 339 


25, 578 


48, 868 


20,513 


-17,430 
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Table 10. The renormalized twin prime race. 

What a remarkable fit! It looks like \7r2k{x) — 7rHL{x)\ is usually well less than ^/x. In 
fact we collected data for A; = 1, 2, . . . , 50 in this range of x, and our prediction seemed to 
always be very good. 

However, this is a paper about prime races after all, and we wanted to further investigate 
whether there are any particular winners, or losers, to this race. We instructed our program 
to take note of when there was a change in first or last position and to give us this 
information as well. 

At the beginning of our investigation, we seemed to have spotted what we thought were 
winners and losers in this prime race. It appeared that the pairs p,p + 60 and p,p + 80 
were ahead of the other pairs with the same prime divisors, based on data up to 5 • 10^. 
However, after counting up to lO-*^^, they were no longer regularly the winners. Similarly 
several prime pairs were consistently the losers at the start of the prime twin race, but 
after a while there were no consistent losers. As an interesting tidbit, our program noted 
that there were 14,455 changes in first place between 10'^ and 10^! 

So in the end, this appears to be a race in which there are no particular winners or 
losers, and still lots of unanswered questions. 



